Overview
Brought to you by YData
Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 1000000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 9569 |
| Duplicate rows (%) | 1.0% |
| Total size in memory | 158.3 MiB |
| Average record size in memory | 166.0 B |
Variable types
| Numeric | 10 |
|---|---|
| Categorical | 16 |
| Dataset has 9569 (1.0%) duplicate rows | Duplicates |
NU_DESEMPENHO is highly overall correlated with NU_MEDIA_GERAL and 10 other fields | High correlation |
NU_INFRAESTRUTURA is highly overall correlated with Q006 | High correlation |
NU_MEDIA_GERAL is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
NU_NOTA_CH is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
NU_NOTA_CN is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
NU_NOTA_LC is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
NU_NOTA_MT is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
NU_NOTA_REDACAO is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
Q006 is highly overall correlated with NU_INFRAESTRUTURA | High correlation |
TP_ANO_CONCLUIU is highly overall correlated with TP_FAIXA_ETARIA | High correlation |
TP_FAIXA_ETARIA is highly overall correlated with TP_ANO_CONCLUIU and 1 other fields | High correlation |
TP_PRESENCA_CH is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
TP_PRESENCA_CN is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
TP_PRESENCA_GERAL is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
TP_PRESENCA_LC is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
TP_PRESENCA_MT is highly overall correlated with NU_DESEMPENHO and 10 other fields | High correlation |
TP_ST_CONCLUSAO is highly overall correlated with TP_FAIXA_ETARIA | High correlation |
TP_ESTADO_CIVIL is highly imbalanced (70.8%) | Imbalance |
TP_DEPENDENCIA_ADM_ESC is highly imbalanced (53.6%) | Imbalance |
Q025 is highly imbalanced (54.7%) | Imbalance |
TP_COR_RACA has 13385 (1.3%) zeros | Zeros |
NU_NOTA_REDACAO has 29787 (3.0%) zeros | Zeros |
TP_ANO_CONCLUIU has 569476 (56.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-04-20 13:00:14.583380 |
|---|---|
| Analysis finished | 2025-04-20 13:02:53.659626 |
| Duration | 2 minutes and 39.08 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
TP_FAIXA_ETARIA
Real number (ℝ)
High correlation 
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.097267 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 7 |
| 95-th percentile | 13 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 3.8733914 |
|---|---|
| Coefficient of variation (CV) | 0.75989573 |
| Kurtosis | 0.33366307 |
| Mean | 5.097267 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1695409 |
| Sum | 5097267 |
| Variance | 15.003161 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 229615 | |
| 2 | 191545 | |
| 4 | 109620 | |
| 1 | 88683 | 8.9% |
| 5 | 67861 | 6.8% |
| 11 | 62794 | 6.3% |
| 6 | 46631 | 4.7% |
| 7 | 35251 | 3.5% |
| 12 | 33905 | 3.4% |
| 8 | 28432 | 2.8% |
| Other values (10) | 105663 |
| Value | Count | Frequency (%) |
| 1 | 88683 | 8.9% |
| 2 | 191545 | |
| 3 | 229615 | |
| 4 | 109620 | |
| 5 | 67861 | 6.8% |
| 6 | 46631 | 4.7% |
| 7 | 35251 | 3.5% |
| 8 | 28432 | 2.8% |
| 9 | 23165 | 2.3% |
| 10 | 18672 | 1.9% |
| Value | Count | Frequency (%) |
| 20 | 244 | < 0.1% |
| 19 | 550 | 0.1% |
| 18 | 1382 | 0.1% |
| 17 | 3441 | 0.3% |
| 16 | 6316 | 0.6% |
| 15 | 10393 | 1.0% |
| 14 | 16912 | 1.7% |
| 13 | 24588 | 2.5% |
| 12 | 33905 | |
| 11 | 62794 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | F |
|---|---|
| 2nd row | F |
| 3rd row | M |
| 4th row | M |
| 5th row | F |
Common Values
| Value | Count | Frequency (%) |
| F | 612211 | |
| M | 387789 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| f | 612211 | |
| m | 387789 |
Most occurring characters
| Value | Count | Frequency (%) |
| F | 612211 | |
| M | 387789 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| F | 612211 | |
| M | 387789 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| F | 612211 | |
| M | 387789 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| F | 612211 | |
| M | 387789 |
TP_ESTADO_CIVIL
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 2 | 50949 |
| 0 | 43691 |
| 3 | 16595 |
| 4 | 1190 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 887575 | |
| 2 | 50949 | 5.1% |
| 0 | 43691 | 4.4% |
| 3 | 16595 | 1.7% |
| 4 | 1190 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 887575 | |
| 2 | 50949 | 5.1% |
| 0 | 43691 | 4.4% |
| 3 | 16595 | 1.7% |
| 4 | 1190 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 887575 | |
| 2 | 50949 | 5.1% |
| 0 | 43691 | 4.4% |
| 3 | 16595 | 1.7% |
| 4 | 1190 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 887575 | |
| 2 | 50949 | 5.1% |
| 0 | 43691 | 4.4% |
| 3 | 16595 | 1.7% |
| 4 | 1190 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 887575 | |
| 2 | 50949 | 5.1% |
| 0 | 43691 | 4.4% |
| 3 | 16595 | 1.7% |
| 4 | 1190 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 887575 | |
| 2 | 50949 | 5.1% |
| 0 | 43691 | 4.4% |
| 3 | 16595 | 1.7% |
| 4 | 1190 | 0.1% |
TP_COR_RACA
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.058675 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 13385 |
| Zeros (%) | 1.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 3 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.0035982 |
|---|---|
| Coefficient of variation (CV) | 0.48749713 |
| Kurtosis | -1.2379797 |
| Mean | 2.058675 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.049803775 |
| Sum | 2058675 |
| Variance | 1.0072093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 434023 | |
| 1 | 400501 | |
| 2 | 129253 | 12.9% |
| 4 | 16591 | 1.7% |
| 0 | 13385 | 1.3% |
| 5 | 6247 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 13385 | 1.3% |
| 1 | 400501 | |
| 2 | 129253 | 12.9% |
| 3 | 434023 | |
| 4 | 16591 | 1.7% |
| 5 | 6247 | 0.6% |
| Value | Count | Frequency (%) |
| 5 | 6247 | 0.6% |
| 4 | 16591 | 1.7% |
| 3 | 434023 | |
| 2 | 129253 | 12.9% |
| 1 | 400501 | |
| 0 | 13385 | 1.3% |
TP_DEPENDENCIA_ADM_ESC
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| -1.0 | |
|---|---|
| 2.0 | |
| 4.0 | 58139 |
| 1.0 | 12410 |
| 3.0 | 2369 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 3.757052 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2.0 |
|---|---|
| 2nd row | -1.0 |
| 3rd row | -1.0 |
| 4th row | -1.0 |
| 5th row | 2.0 |
Common Values
| Value | Count | Frequency (%) |
| -1.0 | 757052 | |
| 2.0 | 170030 | 17.0% |
| 4.0 | 58139 | 5.8% |
| 1.0 | 12410 | 1.2% |
| 3.0 | 2369 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 769462 | |
| 2.0 | 170030 | 17.0% |
| 4.0 | 58139 | 5.8% |
| 3.0 | 2369 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 1000000 | |
| 0 | 1000000 | |
| 1 | 769462 | |
| - | 757052 | |
| 2 | 170030 | 4.5% |
| 4 | 58139 | 1.5% |
| 3 | 2369 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 3757052 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| . | 1000000 | |
| 0 | 1000000 | |
| 1 | 769462 | |
| - | 757052 | |
| 2 | 170030 | 4.5% |
| 4 | 58139 | 1.5% |
| 3 | 2369 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 3757052 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| . | 1000000 | |
| 0 | 1000000 | |
| 1 | 769462 | |
| - | 757052 | |
| 2 | 170030 | 4.5% |
| 4 | 58139 | 1.5% |
| 3 | 2369 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 3757052 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| . | 1000000 | |
| 0 | 1000000 | |
| 1 | 769462 | |
| - | 757052 | |
| 2 | 170030 | 4.5% |
| 4 | 58139 | 1.5% |
| 3 | 2369 | 0.1% |
TP_ST_CONCLUSAO
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | |
| 4 | 4465 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 3 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 482464 | |
| 2 | 355120 | |
| 3 | 157951 | 15.8% |
| 4 | 4465 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 482464 | |
| 2 | 355120 | |
| 3 | 157951 | 15.8% |
| 4 | 4465 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 482464 | |
| 2 | 355120 | |
| 3 | 157951 | 15.8% |
| 4 | 4465 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 482464 | |
| 2 | 355120 | |
| 3 | 157951 | 15.8% |
| 4 | 4465 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 482464 | |
| 2 | 355120 | |
| 3 | 157951 | 15.8% |
| 4 | 4465 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 482464 | |
| 2 | 355120 | |
| 3 | 157951 | 15.8% |
| 4 | 4465 | 0.4% |
SG_UF_PROVA
Categorical
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 977.9 KiB |
| SP | |
|---|---|
| MG | |
| BA | |
| RJ | |
| CE | |
| Other values (22) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | RJ |
|---|---|
| 2nd row | SP |
| 3rd row | MA |
| 4th row | PE |
| 5th row | PR |
Common Values
| Value | Count | Frequency (%) |
| SP | 150003 | |
| MG | 91151 | 9.1% |
| BA | 82841 | 8.3% |
| RJ | 72249 | 7.2% |
| CE | 61938 | 6.2% |
| PA | 57908 | 5.8% |
| PE | 55541 | 5.6% |
| PR | 42307 | 4.2% |
| MA | 42094 | 4.2% |
| RS | 40299 | 4.0% |
| Other values (17) | 303669 |
Length
| Value | Count | Frequency (%) |
| sp | 150003 | |
| mg | 91151 | 9.1% |
| ba | 82841 | 8.3% |
| rj | 72249 | 7.2% |
| ce | 61938 | 6.2% |
| pa | 57908 | 5.8% |
| pe | 55541 | 5.6% |
| pr | 42307 | 4.2% |
| ma | 42094 | 4.2% |
| rs | 40299 | 4.0% |
| Other values (17) | 303669 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 370060 | |
| S | 260654 | |
| A | 241040 | |
| R | 194480 | |
| M | 185562 | |
| E | 152588 | |
| G | 129050 | 6.5% |
| B | 114415 | 5.7% |
| C | 91310 | 4.6% |
| J | 72249 | 3.6% |
| Other values (7) | 188592 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| P | 370060 | |
| S | 260654 | |
| A | 241040 | |
| R | 194480 | |
| M | 185562 | |
| E | 152588 | |
| G | 129050 | 6.5% |
| B | 114415 | 5.7% |
| C | 91310 | 4.6% |
| J | 72249 | 3.6% |
| Other values (7) | 188592 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| P | 370060 | |
| S | 260654 | |
| A | 241040 | |
| R | 194480 | |
| M | 185562 | |
| E | 152588 | |
| G | 129050 | 6.5% |
| B | 114415 | 5.7% |
| C | 91310 | 4.6% |
| J | 72249 | 3.6% |
| Other values (7) | 188592 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| P | 370060 | |
| S | 260654 | |
| A | 241040 | |
| R | 194480 | |
| M | 185562 | |
| E | 152588 | |
| G | 129050 | 6.5% |
| B | 114415 | 5.7% |
| C | 91310 | 4.6% |
| J | 72249 | 3.6% |
| Other values (7) | 188592 |
TP_PRESENCA_CN
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 550 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
TP_PRESENCA_CH
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 1146 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
TP_PRESENCA_LC
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 1146 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 717365 | |
| 0 | 281489 | 28.1% |
| 2 | 1146 | 0.1% |
TP_PRESENCA_MT
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 550 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 684174 | |
| 0 | 315276 | |
| 2 | 550 | 0.1% |
NU_NOTA_CN
Real number (ℝ)
High correlation 
| Distinct | 1402 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 338.95541 |
| Minimum | -1 |
|---|---|
| Maximum | 868.5 |
| Zeros | 4111 |
| Zeros (%) | 0.4% |
| Negative | 315826 |
| Negative (%) | 31.6% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 445 |
| Q3 | 524 |
| 95-th percentile | 615.5 |
| Maximum | 868.5 |
| Range | 869.5 |
| Interquartile range (IQR) | 525 |
Descriptive statistics
| Standard deviation | 242.11099 |
|---|---|
| Coefficient of variation (CV) | 0.71428568 |
| Kurtosis | -1.3920365 |
| Mean | 338.95541 |
| Median Absolute Deviation (MAD) | 106 |
| Skewness | -0.53767043 |
| Sum | 3.3895541 × 108 |
| Variance | 58617.733 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 315826 | |
| 0 | 4111 | 0.4% |
| 529.5 | 1568 | 0.2% |
| 523.5 | 1559 | 0.2% |
| 521 | 1558 | 0.2% |
| 531 | 1557 | 0.2% |
| 513 | 1553 | 0.2% |
| 516.5 | 1547 | 0.2% |
| 522.5 | 1539 | 0.2% |
| 528 | 1539 | 0.2% |
| Other values (1392) | 667643 |
| Value | Count | Frequency (%) |
| -1 | 315826 | |
| 0 | 4111 | 0.4% |
| 320.25 | 1 | < 0.1% |
| 321 | 1 | < 0.1% |
| 322.5 | 1 | < 0.1% |
| 323.25 | 38 | < 0.1% |
| 323.5 | 123 | < 0.1% |
| 323.75 | 74 | < 0.1% |
| 324 | 97 | < 0.1% |
| 324.25 | 45 | < 0.1% |
| Value | Count | Frequency (%) |
| 868.5 | 1 | < 0.1% |
| 856.5 | 1 | < 0.1% |
| 854.5 | 3 | < 0.1% |
| 854 | 8 | |
| 844.5 | 4 | |
| 843.5 | 5 | |
| 843 | 2 | < 0.1% |
| 842 | 2 | < 0.1% |
| 840.5 | 1 | < 0.1% |
| 839.5 | 1 | < 0.1% |
NU_NOTA_CH
Real number (ℝ)
High correlation 
| Distinct | 1432 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 375.11279 |
| Minimum | -1 |
|---|---|
| Maximum | 823 |
| Zeros | 1447 |
| Zeros (%) | 0.1% |
| Negative | 282635 |
| Negative (%) | 28.3% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 483.75 |
| Q3 | 562.5 |
| 95-th percentile | 644 |
| Maximum | 823 |
| Range | 824 |
| Interquartile range (IQR) | 563.5 |
Descriptive statistics
| Standard deviation | 247.71502 |
|---|---|
| Coefficient of variation (CV) | 0.66037477 |
| Kurtosis | -1.1996348 |
| Mean | 375.11279 |
| Median Absolute Deviation (MAD) | 104 |
| Skewness | -0.69057496 |
| Sum | 3.7511279 × 108 |
| Variance | 61362.732 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 282635 | |
| 538 | 1775 | 0.2% |
| 543.5 | 1763 | 0.2% |
| 540.5 | 1758 | 0.2% |
| 535 | 1758 | 0.2% |
| 538.5 | 1756 | 0.2% |
| 542.5 | 1752 | 0.2% |
| 541 | 1750 | 0.2% |
| 536.5 | 1744 | 0.2% |
| 547 | 1742 | 0.2% |
| Other values (1422) | 701567 |
| Value | Count | Frequency (%) |
| -1 | 282635 | |
| 0 | 1447 | 0.1% |
| 290 | 1 | < 0.1% |
| 293.5 | 136 | < 0.1% |
| 293.75 | 17 | < 0.1% |
| 294 | 23 | < 0.1% |
| 294.25 | 16 | < 0.1% |
| 294.5 | 39 | < 0.1% |
| 294.75 | 12 | < 0.1% |
| 295 | 30 | < 0.1% |
| Value | Count | Frequency (%) |
| 823 | 16 | |
| 805 | 8 | < 0.1% |
| 804.5 | 18 | |
| 804 | 8 | < 0.1% |
| 800.5 | 18 | |
| 799.5 | 20 | |
| 798.5 | 3 | < 0.1% |
| 794.5 | 9 | |
| 794 | 1 | < 0.1% |
| 793.5 | 3 | < 0.1% |
NU_NOTA_LC
Real number (ℝ)
High correlation 
| Distinct | 1418 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 371.38743 |
| Minimum | -1 |
|---|---|
| Maximum | 821 |
| Zeros | 554 |
| Zeros (%) | 0.1% |
| Negative | 282635 |
| Negative (%) | 28.3% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 484 |
| Q3 | 550.5 |
| 95-th percentile | 622 |
| Maximum | 821 |
| Range | 822 |
| Interquartile range (IQR) | 551.5 |
Descriptive statistics
| Standard deviation | 242.31092 |
|---|---|
| Coefficient of variation (CV) | 0.65244781 |
| Kurtosis | -1.1718687 |
| Mean | 371.38743 |
| Median Absolute Deviation (MAD) | 88 |
| Skewness | -0.75203672 |
| Sum | 3.7138743 × 108 |
| Variance | 58714.58 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 282635 | |
| 527 | 2067 | 0.2% |
| 537 | 2044 | 0.2% |
| 524 | 2041 | 0.2% |
| 538.5 | 2033 | 0.2% |
| 526 | 2029 | 0.2% |
| 538 | 2018 | 0.2% |
| 535.5 | 2017 | 0.2% |
| 517 | 2011 | 0.2% |
| 514.5 | 2010 | 0.2% |
| Other values (1408) | 699095 |
| Value | Count | Frequency (%) |
| -1 | 282635 | |
| 0 | 554 | 0.1% |
| 287 | 1 | < 0.1% |
| 287.25 | 45 | < 0.1% |
| 287.5 | 4 | < 0.1% |
| 287.75 | 14 | < 0.1% |
| 288 | 8 | < 0.1% |
| 288.25 | 10 | < 0.1% |
| 288.5 | 36 | < 0.1% |
| 288.75 | 15 | < 0.1% |
| Value | Count | Frequency (%) |
| 821 | 1 | < 0.1% |
| 803 | 1 | < 0.1% |
| 801 | 1 | < 0.1% |
| 797.5 | 1 | < 0.1% |
| 795.5 | 1 | < 0.1% |
| 788.5 | 2 | |
| 783.5 | 1 | < 0.1% |
| 781.5 | 3 | |
| 781 | 1 | < 0.1% |
| 780.5 | 1 | < 0.1% |
NU_NOTA_MT
Real number (ℝ)
High correlation 
| Distinct | 1608 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 364.94704 |
| Minimum | -1 |
|---|---|
| Maximum | 958.5 |
| Zeros | 4130 |
| Zeros (%) | 0.4% |
| Negative | 315826 |
| Negative (%) | 31.6% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 438.25 |
| Q3 | 579 |
| 95-th percentile | 729.5 |
| Maximum | 958.5 |
| Range | 959.5 |
| Interquartile range (IQR) | 580 |
Descriptive statistics
| Standard deviation | 271.35744 |
|---|---|
| Coefficient of variation (CV) | 0.74355291 |
| Kurtosis | -1.3437577 |
| Mean | 364.94704 |
| Median Absolute Deviation (MAD) | 185.75 |
| Skewness | -0.3100719 |
| Sum | 3.6494704 × 108 |
| Variance | 73634.859 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 315826 | |
| 0 | 4130 | 0.4% |
| 515.5 | 944 | 0.1% |
| 517 | 928 | 0.1% |
| 519 | 925 | 0.1% |
| 516.5 | 917 | 0.1% |
| 528 | 916 | 0.1% |
| 518 | 913 | 0.1% |
| 520.5 | 909 | 0.1% |
| 520 | 908 | 0.1% |
| Other values (1598) | 672684 |
| Value | Count | Frequency (%) |
| -1 | 315826 | |
| 0 | 4130 | 0.4% |
| 319.75 | 1 | < 0.1% |
| 321 | 1 | < 0.1% |
| 322.75 | 2 | < 0.1% |
| 323.25 | 1 | < 0.1% |
| 324 | 1 | < 0.1% |
| 324.5 | 1 | < 0.1% |
| 325 | 2 | < 0.1% |
| 325.5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 958.5 | 118 | |
| 948 | 7 | < 0.1% |
| 946.5 | 23 | < 0.1% |
| 945.5 | 8 | < 0.1% |
| 945 | 20 | < 0.1% |
| 943.5 | 84 | |
| 941 | 10 | < 0.1% |
| 940 | 2 | < 0.1% |
| 939 | 16 | < 0.1% |
| 938.5 | 1 | < 0.1% |
NU_NOTA_REDACAO
Real number (ℝ)
High correlation  Zeros 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 442.82207 |
| Minimum | -1 |
|---|---|
| Maximum | 1000 |
| Zeros | 29787 |
| Zeros (%) | 3.0% |
| Negative | 282635 |
| Negative (%) | 28.3% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 540 |
| Q3 | 700 |
| 95-th percentile | 920 |
| Maximum | 1000 |
| Range | 1001 |
| Interquartile range (IQR) | 701 |
Descriptive statistics
| Standard deviation | 332.50011 |
|---|---|
| Coefficient of variation (CV) | 0.75086617 |
| Kurtosis | -1.3830158 |
| Mean | 442.82207 |
| Median Absolute Deviation (MAD) | 240 |
| Skewness | -0.24522614 |
| Sum | 4.4282206 × 108 |
| Variance | 110556.32 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 282635 | |
| 560 | 38855 | 3.9% |
| 600 | 38715 | 3.9% |
| 580 | 32057 | 3.2% |
| 640 | 31463 | 3.1% |
| 520 | 30090 | 3.0% |
| 0 | 29787 | 3.0% |
| 540 | 26941 | 2.7% |
| 620 | 26794 | 2.7% |
| 680 | 26173 | 2.6% |
| Other values (41) | 436490 |
| Value | Count | Frequency (%) |
| -1 | 282635 | |
| 0 | 29787 | 3.0% |
| 40 | 27 | < 0.1% |
| 60 | 19 | < 0.1% |
| 80 | 34 | < 0.1% |
| 100 | 21 | < 0.1% |
| 120 | 52 | < 0.1% |
| 140 | 49 | < 0.1% |
| 160 | 176 | < 0.1% |
| 180 | 212 | < 0.1% |
| Value | Count | Frequency (%) |
| 1000 | 13 | < 0.1% |
| 980 | 3195 | 0.3% |
| 960 | 11069 | |
| 940 | 17361 | |
| 920 | 22420 | |
| 900 | 18706 | |
| 880 | 22701 | |
| 860 | 16514 | |
| 840 | 21169 | |
| 820 | 15783 |
TP_PRESENCA_GERAL
Categorical
High correlation 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 0 | |
| 2 | 36794 |
| 3 | 3603 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 680571 | |
| 0 | 279032 | |
| 2 | 36794 | 3.7% |
| 3 | 3603 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 680571 | |
| 0 | 279032 | |
| 2 | 36794 | 3.7% |
| 3 | 3603 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 680571 | |
| 0 | 279032 | |
| 2 | 36794 | 3.7% |
| 3 | 3603 | 0.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 680571 | |
| 0 | 279032 | |
| 2 | 36794 | 3.7% |
| 3 | 3603 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 680571 | |
| 0 | 279032 | |
| 2 | 36794 | 3.7% |
| 3 | 3603 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 680571 | |
| 0 | 279032 | |
| 2 | 36794 | 3.7% |
| 3 | 3603 | 0.4% |
TP_ANO_CONCLUIU
Real number (ℝ)
High correlation  Zeros 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.445723 |
| Minimum | 0 |
|---|---|
| Maximum | 17 |
| Zeros | 569476 |
| Zeros (%) | 56.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3 |
| 95-th percentile | 15 |
| Maximum | 17 |
| Range | 17 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 4.446031 |
|---|---|
| Coefficient of variation (CV) | 1.81788 |
| Kurtosis | 3.6962453 |
| Mean | 2.445723 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.1454808 |
| Sum | 2445723 |
| Variance | 19.767192 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 569476 | |
| 1 | 106613 | 10.7% |
| 2 | 67116 | 6.7% |
| 17 | 42343 | 4.2% |
| 3 | 38173 | 3.8% |
| 4 | 34853 | 3.5% |
| 5 | 26736 | 2.7% |
| 6 | 21764 | 2.2% |
| 7 | 16676 | 1.7% |
| 8 | 13963 | 1.4% |
| Other values (8) | 62287 | 6.2% |
| Value | Count | Frequency (%) |
| 0 | 569476 | |
| 1 | 106613 | 10.7% |
| 2 | 67116 | 6.7% |
| 3 | 38173 | 3.8% |
| 4 | 34853 | 3.5% |
| 5 | 26736 | 2.7% |
| 6 | 21764 | 2.2% |
| 7 | 16676 | 1.7% |
| 8 | 13963 | 1.4% |
| 9 | 11897 | 1.2% |
| Value | Count | Frequency (%) |
| 17 | 42343 | |
| 16 | 5164 | 0.5% |
| 15 | 5372 | 0.5% |
| 14 | 6256 | 0.6% |
| 13 | 6946 | 0.7% |
| 12 | 7298 | 0.7% |
| 11 | 9141 | 0.9% |
| 10 | 10213 | 1.0% |
| 9 | 11897 | 1.2% |
| 8 | 13963 | 1.4% |
NU_DESEMPENHO
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 2 | |
|---|---|
| 3 | |
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 454653 | |
| 3 | 361569 | |
| 1 | 183778 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 454653 | |
| 3 | 361569 | |
| 1 | 183778 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 454653 | |
| 3 | 361569 | |
| 1 | 183778 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 454653 | |
| 3 | 361569 | |
| 1 | 183778 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 454653 | |
| 3 | 361569 | |
| 1 | 183778 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 454653 | |
| 3 | 361569 | |
| 1 | 183778 |
NU_INFRAESTRUTURA
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.6 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 1 |
| 3rd row | 2 |
| 4th row | 2 |
| 5th row | 3 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 366709 | |
| 2 | 343423 | |
| 3 | 289868 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1 | 366709 | |
| 2 | 343423 | |
| 3 | 289868 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 366709 | |
| 2 | 343423 | |
| 3 | 289868 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 1 | 366709 | |
| 2 | 343423 | |
| 3 | 289868 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 1 | 366709 | |
| 2 | 343423 | |
| 3 | 289868 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 1 | 366709 | |
| 2 | 343423 | |
| 3 | 289868 |
NU_MEDIA_GERAL
Real number (ℝ)
High correlation 
| Distinct | 2939 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 378.64485 |
| Minimum | -1 |
|---|---|
| Maximum | 848 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Negative | 279349 |
| Negative (%) | 27.9% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | 480 |
| Q3 | 572.5 |
| 95-th percentile | 681.5 |
| Maximum | 848 |
| Range | 849 |
| Interquartile range (IQR) | 573.5 |
Descriptive statistics
| Standard deviation | 254.75242 |
|---|---|
| Coefficient of variation (CV) | 0.67280043 |
| Kurtosis | -1.2429281 |
| Mean | 378.64485 |
| Median Absolute Deviation (MAD) | 127 |
| Skewness | -0.57788713 |
| Sum | 3.7864485 × 108 |
| Variance | 64898.795 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 279032 | 27.9% |
| 522 | 1589 | 0.2% |
| 540 | 1586 | 0.2% |
| 536 | 1579 | 0.2% |
| 547 | 1570 | 0.2% |
| 533 | 1561 | 0.2% |
| 535 | 1559 | 0.2% |
| 541 | 1557 | 0.2% |
| 513 | 1551 | 0.2% |
| 517 | 1544 | 0.2% |
| Other values (2929) | 706872 |
| Value | Count | Frequency (%) |
| -1 | 279032 | |
| -0.6 | 72 | < 0.1% |
| -0.4 | 245 | < 0.1% |
| 0 | 4 | < 0.1% |
| 39.6 | 1 | < 0.1% |
| 47.6 | 1 | < 0.1% |
| 51.6 | 2 | < 0.1% |
| 55.6 | 2 | < 0.1% |
| 57.06 | 2 | < 0.1% |
| 57.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 848 | 2 | |
| 847 | 1 | |
| 844.5 | 1 | |
| 843.5 | 1 | |
| 842 | 1 | |
| 840.5 | 2 | |
| 839.5 | 1 | |
| 837 | 1 | |
| 836.5 | 1 | |
| 832 | 2 |
Q001
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 977.0 KiB |
| E | |
|---|---|
| B | |
| C | |
| D | |
| H | |
| Other values (3) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | F |
| 3rd row | D |
| 4th row | B |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| E | 283473 | |
| B | 178231 | |
| C | 129969 | |
| D | 111234 | 11.1% |
| H | 102520 | 10.3% |
| F | 84967 | 8.5% |
| G | 65278 | 6.5% |
| A | 44328 | 4.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| e | 283473 | |
| b | 178231 | |
| c | 129969 | |
| d | 111234 | 11.1% |
| h | 102520 | 10.3% |
| f | 84967 | 8.5% |
| g | 65278 | 6.5% |
| a | 44328 | 4.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 283473 | |
| B | 178231 | |
| C | 129969 | |
| D | 111234 | 11.1% |
| H | 102520 | 10.3% |
| F | 84967 | 8.5% |
| G | 65278 | 6.5% |
| A | 44328 | 4.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 283473 | |
| B | 178231 | |
| C | 129969 | |
| D | 111234 | 11.1% |
| H | 102520 | 10.3% |
| F | 84967 | 8.5% |
| G | 65278 | 6.5% |
| A | 44328 | 4.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 283473 | |
| B | 178231 | |
| C | 129969 | |
| D | 111234 | 11.1% |
| H | 102520 | 10.3% |
| F | 84967 | 8.5% |
| G | 65278 | 6.5% |
| A | 44328 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 283473 | |
| B | 178231 | |
| C | 129969 | |
| D | 111234 | 11.1% |
| H | 102520 | 10.3% |
| F | 84967 | 8.5% |
| G | 65278 | 6.5% |
| A | 44328 | 4.4% |
Q002
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 977.0 KiB |
| E | |
|---|---|
| B | |
| D | |
| F | |
| G | |
| Other values (3) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | F |
| 3rd row | C |
| 4th row | B |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| E | 350191 | |
| B | 128472 | 12.8% |
| D | 120093 | 12.0% |
| F | 116429 | 11.6% |
| G | 112447 | 11.2% |
| C | 110653 | 11.1% |
| H | 33453 | 3.3% |
| A | 28262 | 2.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| e | 350191 | |
| b | 128472 | 12.8% |
| d | 120093 | 12.0% |
| f | 116429 | 11.6% |
| g | 112447 | 11.2% |
| c | 110653 | 11.1% |
| h | 33453 | 3.3% |
| a | 28262 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 350191 | |
| B | 128472 | 12.8% |
| D | 120093 | 12.0% |
| F | 116429 | 11.6% |
| G | 112447 | 11.2% |
| C | 110653 | 11.1% |
| H | 33453 | 3.3% |
| A | 28262 | 2.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| E | 350191 | |
| B | 128472 | 12.8% |
| D | 120093 | 12.0% |
| F | 116429 | 11.6% |
| G | 112447 | 11.2% |
| C | 110653 | 11.1% |
| H | 33453 | 3.3% |
| A | 28262 | 2.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| E | 350191 | |
| B | 128472 | 12.8% |
| D | 120093 | 12.0% |
| F | 116429 | 11.6% |
| G | 112447 | 11.2% |
| C | 110653 | 11.1% |
| H | 33453 | 3.3% |
| A | 28262 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| E | 350191 | |
| B | 128472 | 12.8% |
| D | 120093 | 12.0% |
| F | 116429 | 11.6% |
| G | 112447 | 11.2% |
| C | 110653 | 11.1% |
| H | 33453 | 3.3% |
| A | 28262 | 2.8% |
Q005
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.68914 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.6 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.4090862 |
|---|---|
| Coefficient of variation (CV) | 0.38195521 |
| Kurtosis | 5.8725695 |
| Mean | 3.68914 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.1397313 |
| Sum | 3689140 |
| Variance | 1.985524 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4 | 317410 | |
| 3 | 275886 | |
| 2 | 144434 | |
| 5 | 144060 | |
| 6 | 49453 | 4.9% |
| 1 | 37149 | 3.7% |
| 7 | 18083 | 1.8% |
| 8 | 7479 | 0.7% |
| 9 | 2807 | 0.3% |
| 10 | 1720 | 0.2% |
| Other values (10) | 1519 | 0.2% |
| Value | Count | Frequency (%) |
| 1 | 37149 | 3.7% |
| 2 | 144434 | |
| 3 | 275886 | |
| 4 | 317410 | |
| 5 | 144060 | |
| 6 | 49453 | 4.9% |
| 7 | 18083 | 1.8% |
| 8 | 7479 | 0.7% |
| 9 | 2807 | 0.3% |
| 10 | 1720 | 0.2% |
| Value | Count | Frequency (%) |
| 20 | 135 | < 0.1% |
| 19 | 13 | < 0.1% |
| 18 | 24 | < 0.1% |
| 17 | 25 | < 0.1% |
| 16 | 35 | < 0.1% |
| 15 | 101 | < 0.1% |
| 14 | 101 | < 0.1% |
| 13 | 165 | < 0.1% |
| 12 | 373 | |
| 11 | 547 |
Q006
Categorical
High correlation 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 977.4 KiB |
| B | |
|---|---|
| C | |
| D | |
| E | |
| A | |
| Other values (12) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | D |
| 3rd row | B |
| 4th row | C |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| B | 315588 | |
| C | 166128 | |
| D | 111331 | 11.1% |
| E | 74994 | 7.5% |
| A | 68034 | 6.8% |
| G | 66038 | 6.6% |
| F | 43775 | 4.4% |
| H | 35309 | 3.5% |
| I | 21953 | 2.2% |
| J | 19391 | 1.9% |
| Other values (7) | 77459 | 7.7% |
Length
| Value | Count | Frequency (%) |
| b | 315588 | |
| c | 166128 | |
| d | 111331 | 11.1% |
| e | 74994 | 7.5% |
| a | 68034 | 6.8% |
| g | 66038 | 6.6% |
| f | 43775 | 4.4% |
| h | 35309 | 3.5% |
| i | 21953 | 2.2% |
| j | 19391 | 1.9% |
| Other values (7) | 77459 | 7.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 315588 | |
| C | 166128 | |
| D | 111331 | 11.1% |
| E | 74994 | 7.5% |
| A | 68034 | 6.8% |
| G | 66038 | 6.6% |
| F | 43775 | 4.4% |
| H | 35309 | 3.5% |
| I | 21953 | 2.2% |
| J | 19391 | 1.9% |
| Other values (7) | 77459 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 315588 | |
| C | 166128 | |
| D | 111331 | 11.1% |
| E | 74994 | 7.5% |
| A | 68034 | 6.8% |
| G | 66038 | 6.6% |
| F | 43775 | 4.4% |
| H | 35309 | 3.5% |
| I | 21953 | 2.2% |
| J | 19391 | 1.9% |
| Other values (7) | 77459 | 7.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 315588 | |
| C | 166128 | |
| D | 111331 | 11.1% |
| E | 74994 | 7.5% |
| A | 68034 | 6.8% |
| G | 66038 | 6.6% |
| F | 43775 | 4.4% |
| H | 35309 | 3.5% |
| I | 21953 | 2.2% |
| J | 19391 | 1.9% |
| Other values (7) | 77459 | 7.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 315588 | |
| C | 166128 | |
| D | 111331 | 11.1% |
| E | 74994 | 7.5% |
| A | 68034 | 6.8% |
| G | 66038 | 6.6% |
| F | 43775 | 4.4% |
| H | 35309 | 3.5% |
| I | 21953 | 2.2% |
| J | 19391 | 1.9% |
| Other values (7) | 77459 | 7.7% |
Q025
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 976.8 KiB |
| B | |
|---|---|
| A |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | B |
|---|---|
| 2nd row | B |
| 3rd row | B |
| 4th row | B |
| 5th row | B |
Common Values
| Value | Count | Frequency (%) |
| B | 904861 | |
| A | 95139 | 9.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| b | 904861 | |
| a | 95139 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 904861 | |
| A | 95139 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 904861 | |
| A | 95139 | 9.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 904861 | |
| A | 95139 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1000000 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 904861 | |
| A | 95139 | 9.5% |
Interactions
Correlations
| NU_DESEMPENHO | NU_INFRAESTRUTURA | NU_MEDIA_GERAL | NU_NOTA_CH | NU_NOTA_CN | NU_NOTA_LC | NU_NOTA_MT | NU_NOTA_REDACAO | Q001 | Q002 | Q005 | Q006 | Q025 | SG_UF_PROVA | TP_ANO_CONCLUIU | TP_COR_RACA | TP_DEPENDENCIA_ADM_ESC | TP_ESTADO_CIVIL | TP_FAIXA_ETARIA | TP_PRESENCA_CH | TP_PRESENCA_CN | TP_PRESENCA_GERAL | TP_PRESENCA_LC | TP_PRESENCA_MT | TP_SEXO | TP_ST_CONCLUSAO | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| NU_DESEMPENHO | 1.000 | 0.219 | 0.953 | 0.756 | 0.781 | 0.741 | 0.819 | 0.776 | 0.242 | 0.240 | 0.085 | 0.277 | 0.126 | 0.101 | 0.122 | 0.147 | 0.197 | 0.088 | 0.192 | 0.590 | 0.635 | 0.641 | 0.590 | 0.635 | 0.036 | 0.130 |
| NU_INFRAESTRUTURA | 0.219 | 1.000 | 0.241 | 0.212 | 0.216 | 0.215 | 0.251 | 0.191 | 0.319 | 0.303 | 0.170 | 0.528 | 0.423 | 0.278 | 0.121 | 0.219 | 0.172 | 0.059 | 0.198 | 0.114 | 0.113 | 0.116 | 0.114 | 0.113 | 0.099 | 0.150 |
| NU_MEDIA_GERAL | 0.953 | 0.241 | 1.000 | 0.918 | 0.913 | 0.910 | 0.936 | 0.924 | 0.143 | 0.140 | 0.017 | 0.149 | 0.149 | 0.062 | -0.143 | -0.168 | 0.151 | 0.070 | -0.282 | 0.702 | 0.681 | 0.750 | 0.702 | 0.681 | 0.050 | 0.146 |
| NU_NOTA_CH | 0.756 | 0.212 | 0.918 | 1.000 | 0.837 | 0.918 | 0.837 | 0.816 | 0.121 | 0.119 | 0.001 | 0.145 | 0.135 | 0.074 | -0.107 | -0.162 | 0.123 | 0.063 | -0.228 | 0.705 | 0.642 | 0.579 | 0.705 | 0.642 | 0.067 | 0.147 |
| NU_NOTA_CN | 0.781 | 0.216 | 0.913 | 0.837 | 1.000 | 0.827 | 0.882 | 0.774 | 0.126 | 0.122 | 0.007 | 0.154 | 0.130 | 0.069 | -0.109 | -0.151 | 0.131 | 0.060 | -0.224 | 0.635 | 0.700 | 0.572 | 0.635 | 0.700 | 0.119 | 0.133 |
| NU_NOTA_LC | 0.741 | 0.215 | 0.910 | 0.918 | 0.827 | 1.000 | 0.829 | 0.814 | 0.122 | 0.121 | 0.003 | 0.144 | 0.142 | 0.078 | -0.122 | -0.167 | 0.117 | 0.065 | -0.249 | 0.706 | 0.643 | 0.580 | 0.706 | 0.643 | 0.044 | 0.143 |
| NU_NOTA_MT | 0.819 | 0.251 | 0.936 | 0.837 | 0.882 | 0.829 | 1.000 | 0.798 | 0.144 | 0.139 | 0.022 | 0.177 | 0.150 | 0.080 | -0.148 | -0.163 | 0.143 | 0.065 | -0.280 | 0.635 | 0.700 | 0.572 | 0.635 | 0.700 | 0.142 | 0.128 |
| NU_NOTA_REDACAO | 0.776 | 0.191 | 0.924 | 0.816 | 0.774 | 0.814 | 0.798 | 1.000 | 0.118 | 0.122 | 0.037 | 0.112 | 0.121 | 0.043 | -0.178 | -0.140 | 0.134 | 0.077 | -0.317 | 0.658 | 0.614 | 0.542 | 0.658 | 0.614 | 0.085 | 0.129 |
| Q001 | 0.242 | 0.319 | 0.143 | 0.121 | 0.126 | 0.122 | 0.144 | 0.118 | 1.000 | 0.341 | 0.049 | 0.238 | 0.194 | 0.088 | 0.078 | 0.115 | 0.137 | 0.080 | 0.126 | 0.123 | 0.123 | 0.103 | 0.123 | 0.123 | 0.069 | 0.134 |
| Q002 | 0.240 | 0.303 | 0.140 | 0.119 | 0.122 | 0.121 | 0.139 | 0.122 | 0.341 | 1.000 | 0.050 | 0.213 | 0.194 | 0.080 | 0.089 | 0.107 | 0.132 | 0.100 | 0.149 | 0.133 | 0.134 | 0.112 | 0.133 | 0.134 | 0.072 | 0.143 |
| Q005 | 0.085 | 0.170 | 0.017 | 0.001 | 0.007 | 0.003 | 0.022 | 0.037 | 0.049 | 0.050 | 1.000 | 0.063 | 0.091 | 0.056 | -0.166 | 0.055 | 0.059 | 0.051 | -0.182 | 0.071 | 0.071 | 0.060 | 0.071 | 0.071 | 0.018 | 0.113 |
| Q006 | 0.277 | 0.528 | 0.149 | 0.145 | 0.154 | 0.144 | 0.177 | 0.112 | 0.238 | 0.213 | 0.063 | 1.000 | 0.322 | 0.102 | 0.043 | 0.147 | 0.166 | 0.026 | 0.082 | 0.114 | 0.115 | 0.096 | 0.114 | 0.115 | 0.109 | 0.120 |
| Q025 | 0.126 | 0.423 | 0.149 | 0.135 | 0.130 | 0.142 | 0.150 | 0.121 | 0.194 | 0.194 | 0.091 | 0.322 | 1.000 | 0.214 | 0.036 | 0.137 | 0.079 | 0.013 | 0.083 | 0.062 | 0.061 | 0.063 | 0.062 | 0.061 | 0.039 | 0.046 |
| SG_UF_PROVA | 0.101 | 0.278 | 0.062 | 0.074 | 0.069 | 0.078 | 0.080 | 0.043 | 0.088 | 0.080 | 0.056 | 0.102 | 0.214 | 1.000 | 0.046 | 0.176 | 0.119 | 0.034 | 0.064 | 0.048 | 0.050 | 0.046 | 0.048 | 0.050 | 0.039 | 0.102 |
| TP_ANO_CONCLUIU | 0.122 | 0.121 | -0.143 | -0.107 | -0.109 | -0.122 | -0.148 | -0.178 | 0.078 | 0.089 | -0.166 | 0.043 | 0.036 | 0.046 | 1.000 | 0.063 | 0.196 | 0.216 | 0.746 | 0.153 | 0.141 | 0.126 | 0.153 | 0.141 | 0.013 | 0.414 |
| TP_COR_RACA | 0.147 | 0.219 | -0.168 | -0.162 | -0.151 | -0.167 | -0.163 | -0.140 | 0.115 | 0.107 | 0.055 | 0.147 | 0.137 | 0.176 | 0.063 | 1.000 | 0.074 | 0.042 | 0.119 | 0.066 | 0.065 | 0.055 | 0.066 | 0.065 | 0.019 | 0.072 |
| TP_DEPENDENCIA_ADM_ESC | 0.197 | 0.172 | 0.151 | 0.123 | 0.131 | 0.117 | 0.143 | 0.134 | 0.137 | 0.132 | 0.059 | 0.166 | 0.079 | 0.119 | 0.196 | 0.074 | 1.000 | 0.058 | 0.199 | 0.100 | 0.105 | 0.086 | 0.100 | 0.105 | 0.075 | 0.441 |
| TP_ESTADO_CIVIL | 0.088 | 0.059 | 0.070 | 0.063 | 0.060 | 0.065 | 0.065 | 0.077 | 0.080 | 0.100 | 0.051 | 0.026 | 0.013 | 0.034 | 0.216 | 0.042 | 0.058 | 1.000 | 0.271 | 0.088 | 0.084 | 0.073 | 0.088 | 0.084 | 0.017 | 0.124 |
| TP_FAIXA_ETARIA | 0.192 | 0.198 | -0.282 | -0.228 | -0.224 | -0.249 | -0.280 | -0.317 | 0.126 | 0.149 | -0.182 | 0.082 | 0.083 | 0.064 | 0.746 | 0.119 | 0.199 | 0.271 | 1.000 | 0.210 | 0.199 | 0.175 | 0.210 | 0.199 | 0.028 | 0.522 |
| TP_PRESENCA_CH | 0.590 | 0.114 | 0.702 | 0.705 | 0.635 | 0.706 | 0.635 | 0.658 | 0.123 | 0.133 | 0.071 | 0.114 | 0.062 | 0.048 | 0.153 | 0.066 | 0.100 | 0.088 | 0.210 | 1.000 | 0.642 | 0.709 | 1.000 | 0.642 | 0.010 | 0.167 |
| TP_PRESENCA_CN | 0.635 | 0.113 | 0.681 | 0.642 | 0.700 | 0.643 | 0.700 | 0.614 | 0.123 | 0.134 | 0.071 | 0.115 | 0.061 | 0.050 | 0.141 | 0.065 | 0.105 | 0.084 | 0.199 | 0.642 | 1.000 | 0.712 | 0.642 | 1.000 | 0.007 | 0.152 |
| TP_PRESENCA_GERAL | 0.641 | 0.116 | 0.750 | 0.579 | 0.572 | 0.580 | 0.572 | 0.542 | 0.103 | 0.112 | 0.060 | 0.096 | 0.063 | 0.046 | 0.126 | 0.055 | 0.086 | 0.073 | 0.175 | 0.709 | 0.712 | 1.000 | 0.709 | 0.712 | 0.012 | 0.142 |
| TP_PRESENCA_LC | 0.590 | 0.114 | 0.702 | 0.705 | 0.635 | 0.706 | 0.635 | 0.658 | 0.123 | 0.133 | 0.071 | 0.114 | 0.062 | 0.048 | 0.153 | 0.066 | 0.100 | 0.088 | 0.210 | 1.000 | 0.642 | 0.709 | 1.000 | 0.642 | 0.010 | 0.167 |
| TP_PRESENCA_MT | 0.635 | 0.113 | 0.681 | 0.642 | 0.700 | 0.643 | 0.700 | 0.614 | 0.123 | 0.134 | 0.071 | 0.115 | 0.061 | 0.050 | 0.141 | 0.065 | 0.105 | 0.084 | 0.199 | 0.642 | 1.000 | 0.712 | 0.642 | 1.000 | 0.007 | 0.152 |
| TP_SEXO | 0.036 | 0.099 | 0.050 | 0.067 | 0.119 | 0.044 | 0.142 | 0.085 | 0.069 | 0.072 | 0.018 | 0.109 | 0.039 | 0.039 | 0.013 | 0.019 | 0.075 | 0.017 | 0.028 | 0.010 | 0.007 | 0.012 | 0.010 | 0.007 | 1.000 | 0.047 |
| TP_ST_CONCLUSAO | 0.130 | 0.150 | 0.146 | 0.147 | 0.133 | 0.143 | 0.128 | 0.129 | 0.134 | 0.143 | 0.113 | 0.120 | 0.046 | 0.102 | 0.414 | 0.072 | 0.441 | 0.124 | 0.522 | 0.167 | 0.152 | 0.142 | 0.167 | 0.152 | 0.047 | 1.000 |
Missing values
Sample
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_DEPENDENCIA_ADM_ESC | TP_ST_CONCLUSAO | SG_UF_PROVA | TP_PRESENCA_CN | TP_PRESENCA_CH | TP_PRESENCA_LC | TP_PRESENCA_MT | NU_NOTA_CN | NU_NOTA_CH | NU_NOTA_LC | NU_NOTA_MT | NU_NOTA_REDACAO | TP_PRESENCA_GERAL | TP_ANO_CONCLUIU | NU_DESEMPENHO | NU_INFRAESTRUTURA | NU_MEDIA_GERAL | Q001 | Q002 | Q005 | Q006 | Q025 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3 | F | 1 | 1 | 2.0 | 2 | RJ | 1 | 1 | 1 | 1 | 409.00 | 527.00 | 418.00 | 416.50 | 720 | 1 | 0 | 2 | 3 | 498.0 | C | C | 3 | B | B |
| 1 | 2 | F | 1 | 1 | -1.0 | 3 | SP | 1 | 1 | 1 | 1 | 499.50 | 535.50 | 549.00 | 570.50 | 560 | 1 | 0 | 2 | 1 | 543.0 | F | F | 3 | D | B |
| 2 | 8 | M | 1 | 2 | -1.0 | 1 | MA | 1 | 1 | 1 | 1 | 425.25 | 391.75 | 446.00 | 503.50 | 460 | 1 | 0 | 2 | 2 | 445.2 | D | C | 2 | B | B |
| 3 | 12 | M | 1 | 1 | -1.0 | 1 | PE | 1 | 1 | 1 | 1 | 621.00 | 584.50 | 493.00 | 412.75 | 820 | 1 | 16 | 2 | 2 | 586.0 | B | B | 4 | C | B |
| 4 | 2 | F | 1 | 1 | 2.0 | 2 | PR | 1 | 1 | 1 | 1 | 445.00 | 458.00 | 457.25 | 491.50 | 400 | 1 | 0 | 2 | 3 | 450.2 | B | B | 4 | B | B |
| 5 | 3 | F | 1 | 3 | -1.0 | 2 | BA | 0 | 0 | 0 | 0 | -1.00 | -1.00 | -1.00 | -1.00 | -1 | 0 | 0 | 3 | 3 | -1.0 | C | E | 2 | C | B |
| 6 | 11 | M | 2 | 3 | -1.0 | 1 | SP | 1 | 1 | 1 | 1 | 617.50 | 673.00 | 705.00 | 724.50 | 600 | 1 | 11 | 1 | 2 | 664.0 | C | C | 4 | E | B |
| 7 | 2 | M | 1 | 3 | 3.0 | 2 | SP | 1 | 1 | 1 | 1 | 407.25 | 583.00 | 479.50 | 601.50 | 660 | 1 | 0 | 2 | 1 | 546.0 | D | E | 3 | E | B |
| 8 | 12 | F | 0 | 2 | -1.0 | 1 | MS | 1 | 1 | 1 | 1 | 518.00 | 630.00 | 611.00 | 538.00 | 580 | 1 | 7 | 2 | 1 | 575.5 | H | E | 4 | C | B |
| 9 | 11 | F | 1 | 3 | -1.0 | 1 | SP | 0 | 0 | 0 | 0 | -1.00 | -1.00 | -1.00 | -1.00 | -1 | 0 | 11 | 3 | 2 | -1.0 | B | C | 3 | G | B |
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_DEPENDENCIA_ADM_ESC | TP_ST_CONCLUSAO | SG_UF_PROVA | TP_PRESENCA_CN | TP_PRESENCA_CH | TP_PRESENCA_LC | TP_PRESENCA_MT | NU_NOTA_CN | NU_NOTA_CH | NU_NOTA_LC | NU_NOTA_MT | NU_NOTA_REDACAO | TP_PRESENCA_GERAL | TP_ANO_CONCLUIU | NU_DESEMPENHO | NU_INFRAESTRUTURA | NU_MEDIA_GERAL | Q001 | Q002 | Q005 | Q006 | Q025 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 999990 | 1 | M | 1 | 3 | -1.0 | 3 | GO | 1 | 1 | 1 | 1 | 549.00 | 563.00 | 589.50 | 726.50 | 840 | 1 | 0 | 1 | 1 | 653.5 | F | G | 4 | H | B |
| 999991 | 1 | F | 1 | 3 | -1.0 | 3 | PA | 0 | 0 | 0 | 0 | -1.00 | -1.00 | -1.00 | -1.00 | -1 | 0 | 0 | 3 | 3 | -1.0 | D | E | 3 | A | B |
| 999992 | 2 | M | 1 | 3 | -1.0 | 3 | PE | 1 | 1 | 1 | 1 | 491.50 | 512.50 | 584.50 | 643.50 | 560 | 1 | 0 | 2 | 2 | 558.5 | C | E | 5 | D | B |
| 999993 | 2 | F | 1 | 1 | -1.0 | 3 | BA | 1 | 1 | 1 | 1 | 479.50 | 548.50 | 533.50 | 420.25 | 620 | 1 | 0 | 2 | 2 | 520.5 | E | E | 5 | C | B |
| 999994 | 2 | F | 1 | 0 | -1.0 | 3 | SP | 0 | 0 | 0 | 0 | -1.00 | -1.00 | -1.00 | -1.00 | -1 | 0 | 0 | 3 | 2 | -1.0 | D | E | 6 | B | B |
| 999995 | 2 | M | 1 | 4 | -1.0 | 3 | PI | 0 | 0 | 0 | 0 | -1.00 | -1.00 | -1.00 | -1.00 | -1 | 0 | 0 | 3 | 2 | -1.0 | H | E | 4 | B | B |
| 999996 | 11 | M | 3 | 3 | -1.0 | 1 | PA | 1 | 1 | 1 | 1 | 497.25 | 334.25 | 480.75 | 441.75 | 540 | 1 | 0 | 2 | 3 | 458.8 | D | G | 3 | B | B |
| 999997 | 3 | F | 1 | 2 | -1.0 | 1 | PR | 0 | 0 | 0 | 0 | -1.00 | -1.00 | -1.00 | -1.00 | -1 | 0 | 1 | 3 | 1 | -1.0 | G | C | 4 | H | B |
| 999998 | 4 | M | 1 | 3 | -1.0 | 1 | AP | 1 | 1 | 1 | 1 | 439.00 | 390.25 | 454.25 | 423.75 | 0 | 1 | 1 | 3 | 1 | 341.5 | E | B | 6 | G | B |
| 999999 | 3 | F | 1 | 1 | -1.0 | 1 | SP | 1 | 1 | 1 | 1 | 486.25 | 510.50 | 460.50 | 519.50 | 520 | 1 | 1 | 2 | 2 | 499.2 | D | E | 2 | E | B |
Duplicate rows
Most frequently occurring
| TP_FAIXA_ETARIA | TP_SEXO | TP_ESTADO_CIVIL | TP_COR_RACA | TP_DEPENDENCIA_ADM_ESC | TP_ST_CONCLUSAO | SG_UF_PROVA | TP_PRESENCA_CN | TP_PRESENCA_CH | TP_PRESENCA_LC | TP_PRESENCA_MT | NU_NOTA_CN | NU_NOTA_CH | NU_NOTA_LC | NU_NOTA_MT | NU_NOTA_REDACAO | TP_PRESENCA_GERAL | TP_ANO_CONCLUIU | NU_DESEMPENHO | NU_INFRAESTRUTURA | NU_MEDIA_GERAL | Q001 | Q002 | Q005 | Q006 | Q025 | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4903 | 3 | M | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | H | H | 4 | B | B | 30 |
| 3537 | 3 | F | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | H | H | 3 | B | B | 24 |
| 4898 | 3 | M | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | H | H | 3 | B | B | 22 |
| 2542 | 3 | F | 1 | 1 | 2.0 | 2 | SP | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 1 | -1.0 | E | E | 4 | D | B | 20 |
| 3543 | 3 | F | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | H | H | 4 | B | B | 20 |
| 4851 | 3 | M | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | B | B | 4 | B | B | 20 |
| 1710 | 2 | M | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | H | H | 3 | B | B | 19 |
| 4902 | 3 | M | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | H | H | 4 | B | A | 18 |
| 4849 | 3 | M | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | B | B | 3 | B | B | 16 |
| 4866 | 3 | M | 1 | 3 | 2.0 | 2 | CE | 0 | 0 | 0 | 0 | -1.0 | -1.0 | -1.0 | -1.0 | -1 | 0 | 0 | 3 | 3 | -1.0 | C | C | 4 | B | A | 16 |